An Introduction to Adaptive Critic Control: A Paradigm Based on Approximate Dynamic Programming
نویسنده
چکیده
Adaptive critic control is an advanced control technology developed for nonlinear dynamical systems in recent years. It is based on the idea of approximate dynamic programming. Dynamic programming was introduced by Bellman in the 1950’s for solving optimal control problems of nonlinear dynamical systems. Due to its high computational complexity, applications of dynamic programming have been limited to simple and small problems. The key step in finding approximate solutions to dynamic programming is to estimate the cost function in dynamic programming. The optimal control signal can then be determined by minimizing (or maximizing) the cost function. Due to their universal approximation capability, artificial neural networks are often used to represent the cost function in dynamic programming. The implementation of approximate dynamic programming usually requires the use of three modules-Critic, Model, and Action. These three modules perform the function of evaluation, prediction, and decision, respectively. This article introduces some basic algorithms of adaptive critic control and some recent development of the area. It will also outline some future perspectives of this new control technology.
منابع مشابه
A New Hybrid Critic-training Method for Approximate Dynamic Programming
A variety of methods for developing quasi-optimal intelligent control systems using reinforcement learning techniques based on adaptive critics have appeared in recent years. This paper reviews the family of approximate dynamic programming techniques based on adaptive critic methods and introduces a new hybrid critic training method.
متن کاملImproving quasi-optimal inventory and transportation policies using adaptive critic based approximate dynamic programming
We demonstrate the possibility of optimal control of physical inventory systems in a nonstationary fitness terrain, based on the combined application of evolutionary search and adaptive critic terrain following. We show that adaptive critic based approximate dynamic programming techniques based on plant-controller Jacobeans can be used with systems characterized by discrete valued states and co...
متن کاملPartial, noisy and qualitative models for adaptive critic based neurocontrol
The roles of plant models in adaptive critic methods for approximate dynamic programming are considered, with primary focus given to the DHP methodology. In place of complete system identification, partial, approximate, and qualitative models of plant dynamics are considered. Such models are found to be sufficient for successful controller design. As classification is in general easier than reg...
متن کاملAdaptive critic based approximate dynamic programming: A new tool for smart manufacturing
This work supported in part by the National Science Foundation under grant ECS-9904378. Abstract Adaptive critic based approximate dynamic programming techniques are gradient based methods for finding optimal policies for multi-stage decision processes. We believe adaptive critic methods are now developed to the point that they can be applied to the full spectrum of decision and control problem...
متن کاملStochastic Control Strategies and Adaptive Critic Methods
Adaptive critic methods have common roots as generalizations of dynamic programming for neural reinforcement learning approaches. Since they approximate the dynamic programming solutions, they are potentially suitable for learning in noisy, nonlinear and nonstationary environments. In this study, a novel probabilistic dual heuristic programming (DHP) based adaptive critic controller is proposed...
متن کامل